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COUNTING DESIGNS 


PETER KEEVASH 


Abstract. We give estimates on the number of combinatorial designs, 
which prove (and generalise) a conjecture of Wilson from 1974 on the 
number of Steiner Triple Systems. This paper also serves as an ex¬ 
pository treatment of our recently developed method of Randomised 
Algebraic Construction: we give a simpler proof of a special case of our 
result on clique decompositions of hypergraphs, namely triangle decom¬ 
positions of quasirandom graphs. 


1. Introduction 

When does a graph G have a triangle decomposition? (By this we mean 
a partition of its edge set into triangles.) There are two obvious necessary 
‘divisibility conditions’: the number of edges must be divisible by three, 
and the degree of any vertex must be even. We say that G is tridivisible if 
it satisfies these divisibility conditions. In 1847 Kirkman proved that any 
tridivisible complete graph has a triangle decomposition; equivalently, there 
is a Steiner Triple System on n vertices if n is 1 or 3 mod 6. In [5] we showed 
more generally that a tridivisible graph has a triangle decomposition if we 
assume a certain pseudorandomness condition. In fact, we proved a more 
general result on clique decompositions of simplicial complexes, which in 
particular proved the Existence Conjecture for combinatorial designs. 

One purpose of the current paper is to illustrate the new technique (Ran¬ 
domised Algebraic Construction) of [5] in the simplified setting of triangle 
decompositions; we will also prove a conjecture of Wilson [12] on the num¬ 
ber of Steiner Triple Systems. These results are proved in the next three 
sections, roughly following the method of |5| , but introducing some novelties 
in technique that lead to considerable simplifications in the case of triangle 
decompositions; the material here closely follows a lecture series that the 
author recently gave at the Israel Institute for Advanced Studies. In Section 
we sketch an argument of Bennett and Bohman [T] on the random greedy 
matching process and adapt the calculations to the version needed in this 
paper. We generalise from Steiner Triple Systems to designs in Section 
We conclude by noting that it remains open to obtain an asymptotic formula 
for the number of designs, or even just for the number of regular graphs. 

2. Triangle decompositions 

We start by stating our result that tridivisible pseudorandom graphs have 
triangle decompositions. The pseudorandomness condition is as follows. Let 
G be a graph on n vertices. The density of G is d{G) = \G\/{^. We say 
that G is c-typical if every vertex has (1 ± c)d{G)n neighbours and every 
pair of vertices have (1 ± c)d{G)‘^n common neighbours. (We write 6± c for 
any real between b — c and b + c.) 
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Theorem 2.1. There exists 0 < cq < 1 and uq € N so that if n > no 
and G is a c-typical tridivisible graph on n vertices with d{G) > n“^° and 
c < cod{Gy^^ then G has a triangle decomposition. 


Note that in Theorem 2.1 we allow the density to decay polynomially with 
n; this will be important for the application in the next subsection, but in 
many cases of interest one can consider d{G) and c to be fixed constants 
independent of n. One such consequence of Theorem |2.1| noted in [5] is that 
the standard random graph model G(n, 1/2) with high probability (whp) 
has a partial triangle decomposition that covers all but (1 + o(l))n/4 edges. 
Indeed, deleting a perfect matching on the set of vertices of odd degree 
and then at most two 4-cycles gives a graph satisfying the hypotheses of the 
theorem. This is asymptotically best possible, as whp there are (l-|-o(l))n/2 
vertices of odd degree, and any set of edge-disjoint triangles must leave at 
least one edge uncovered at each vertex of odd degree. 

We remark that our definition of typicality here is weaker than that used 
in [5j. In fact, for most of the paper we will assume the stronger version, 
then explain at the end how the proof can be modified to work with the 
current definition. We also make the (well-known) remark that typicality 
implies the standard regularity property (for appropriate constants) that 
appears in Szemeredi’s Regularity Lemma, but the converse is not true, as 
regularity allows individual vertices to behave badly, even to be isolated. 


2.1. The number of Steiner Triple Systems. Another purpose of our 
paper is to prove the following conjecture of Wilson [T2] on the number 
of Steiner Triple Systems on n vertices, i.e. triangle decompositions of the 
complete graph Kn, denote this by STS{n). 

Theorem 2.2. If n is 1 or 3 mod 6, then STS{n) = (n/e^-|-o(n))"'^/®. 


Note that Kn is tridivisible if and only if n is 1 or 3 mod 6, so STS{n) = 0 
for all other n. The upper bound in Theorem |2.2| was recently proved by 
Linial and Luria [8], who showed that STS{n) < (n/e^ -|- 0{y/n))'^^. Our 
lower bound will be STS{n) > [nj^ -|- 0(n^““))” for some small a > 0. 

Theorem |2.2| will follow quite easily from Theorem |2.1| and the semiran¬ 
dom method (nibble). It will be most convenient for us to apply the results 
of Bohman, Frieze and Lubetzky |2] on the triangle removal process (al¬ 
though we could make do with a simpler nibble argument, or the argument 
of Bennett and Bohman [Tj sketched in Section . We say that an event 
E holds with high probability (whp) if P(£') = 1 — for some c > 0 

as n —?• oo; note that when n is sufficiently large, by union bounds we can 
assume that any specified polynomial number of such events all occur. 

In the triangle removal process, we start with the complete graph K^, 
and at each step we delete the edges of a uniformly random triangle in the 
current graph. It is shown in [2] that whp the process persists until only 
q(j^3/2+o(i)) g(^ggg rgniain, but we will stop at edges (i.e. at the 

nearest multiple of 3 to this number) so that we can apply Theorem 2.1 We 


need the following additional facts from [2] about this stopped process: whp 
the final graph is n“^/^-typical, and when pn^/2 edges remain the number 
of choices for the deleted triangle is (I ± n“^/^)(pn)^/6. 




Proof of Theorem |2.2| , Consider the following procedure for constructing 
a Steiner Triple System on n vertices: run the tria ngle removal process 
until edges remain, then apply Theorem 2.1 (if its hypotheses are 


satisfied, which occurs in 1 — o(l) proportion of all instances of the process). 
Writing m for the number of steps and p{i) = 1 — GijnP, the logarithm of 
the number of choices in this procedure is 


Li = ^(log(p(f)^n^/6) ± 2n = (n^/6)(log(n^/6) — 3 ± n *), 

i=l 


since logp(i) = {l+0{n~^^ ^ log n))(n^/6) logp dp and Jq logp dp = 
— 1. Also, for any fixed Steiner Triple System, the logarithm of the number 
of times it is counted by this procedure is at most 


L 2 = ^ log(p(f)n^/6) = (n^/6)(log(n^/6) — 1 ± n 
^=1 


Therefore log(5T5(n)) > Li — L 2 = (n^/6)(log(n) — 2 ± 2n *), which 
implies the stated bound on STS{n). □ 


2.2. Strategy. The strategy of the proof of Theorem |2.1| is encapsulated 
by the following setup (we give motivation and discussion below). We say 
that J C G is c-bounded if |J(n)| < c\V{G)\ for every v G V{G), where 
J{v) = {u £ y{G) ■. uv £ J} is the neighbourhood of v in J. 

Setup 2.3. Suppose we have G* <£ G with a ‘template’ triangle decompo¬ 
sition T such that 

Nibble: G\G* contains a set N of edge-disjoint triangles with ‘leave’ 
L -.= {G\G*)\ UN that is ci-bounded. 

Cover: For any L <£ G \G* that is ci-bounded, there is a set M'’ of 
edge-disjoint triangles such that L = {G\G*)ri(L)M^) and the ‘spill’ 
S := G* n (UM'’) is C2-bounded, 

Hole: For any tridivisible S <£ G* that is C2-bounded, there are ‘outer’ 
and ‘inner’ sets M°, M* of edge-disjoint triangles in G* such that 
UM° is C3-bounded and {S, UM*) is a partition of UM°, 
Completion: Given L, M° and M* as above, there are sets Mi, 
M2, M3, M4 of edge-disjoint triangles in G* such that (L, UM2) is a 
partition of UMi, UM3 = UM4, M3 C T and M2 C M4. 


The key step is choosing T (which determines G*). We will use our 
method of Randomised Algebraic Construction, which takes a particularly 
simple form for triangle decompositions. To motivate the construction, sup¬ 
pose that V(G) is an abelian group, and consider the set S of triples xyz 
such that X -h y + z = 0. We note that S is a good ‘model’ for a triangle 
decomposition, as for any xy there is a unique z such that x -|- y = 0. 
However, we cannot simply take S, as not all such xyz are triangles of G; 
moreover, x, y, z may not even be pairwise distinct. 

The idea of the construction is that a suitable random subset of S can 
act as a template, which covers a constant fraction of G. Next we find an 
approximate decomposition of the rest of G by random greedy algorithms; 
this is accomplished by steps Nibble and Cover of Setup |2.3[ After these 
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steps, every edge of G has been covered once or twice, and the spill S is the 
set of edges that have been covered twice. Finally, we use local modifications 
built into the template to turn the approximate decomposition into an exact 
decomposition: this is accomplished by steps Hole and Completion of 
Setup |2.3[ 

To motivate Completion, we imagine first that we have Hole and also 
M° C T. Then we could delete M° and take M* instead, thus reducing by 
one the multiplicity of every edge in S, so that we have a triangle decompo¬ 
sition of G. However, specifying a triangle of T is very restrictive, as there 
are only order(n^) such triangles out of a total of order(n^) triangles in G. 
If we had chosen T uniformly at random it would be hopeless to obtain any 
useful configuration formed by triangles of T. However, the algebraic struc¬ 
ture implies that certain configurations of triangles are dense within a sparse 
configuration space (described by linear constraints). This forms the basis 
of a modification procedure that replaces M° and M* by other sets of 
triangles with the same properties, where Mi plays the role of M 2 

of M°, and each triangle / of M 2 can be embedded in a small subgraph that 
has one triangle decomposition (part of M 4 ) using / and another triangle 
decomposition (part of M 3 ) contained in T. 

It is not hard to see that G contains a triangle decomposition in Setup 
Indeed, we start by taking the sets N provided by Nibble and then the sets 
and S provided by Cover. Now we note that S = LIT + LiN + UM'^ — G 
is tridivisible, as any integer linear combination of tridivisible graphs is 
tridivisible. So we can apply Hole to obtain M° and M*. Then we can 
apply Completion to obtain Mi, M 2 , M 3 , M 4 . Finally, M = N LI Mi U 
(M 4 \ M 2 ) U (T \ M 3 ) is a triangle decomposition of G. Thus the remainder 


of the proof will be to show that we can achieve Setup 2.3 


2.3. Template. We choose the template as follows. 

Construction 2.4. Let a G N be such that 2““^ < |F(G)| < 2““^. Let 
TT : V{G) — )• F2a \ {0} be a uniformly random injection. Let 

T = {xyz G K^{G) : tt{x) + Ti{y) + vr(z) = 0} and G* = UT. 

To avoid cumbersome notation, we use xyz to denote either the vertex set 
{x, y, z} or the edge set {xy, xz, yz} of a triangle. The context determines 
which interpretation is intended, e.g. in Construction 2.4 the graph G* is 
the (disjoint) union of the edge-sets of the triangles in T. 

In this subsection we will show that whp the pair {G,G*) is ‘typical’ (in 
a precise sense defined below); this will allow us to implement the approxi¬ 
mate decomposition in steps Nibble and Cover. Moreover, we will show in 
Section 1^ that G* is ‘linearly typical’ (roughly speaking: we can count sub¬ 
graph extensions with linear constraints on the vertices); this will imply the 
existence of the local modifications used in steps Hole and Completion. 

We start with some notation and preliminary observations. Throughout 
we write n = |F(G')|. We identify G with its edge set E{G), so that |G| 
denotes the number of edges of G (rather than the number of vertices, as is 
used by some authors). We write [n] = {1,..., n}. We define 


7 = 2- 
4 


n, 




and note that 1/4 < 7 < 1/2. We observe that if x,y,z G F 2 a \ {0} and 
X + y + z = 0 then x,y, z are pairwise distinct. We note that +1 = —1 in 
F 2 a, so we can use + and — interchangeably in F 2 a-arithmetic. We consider 
F 2 a as a vector space over F 2 , and observe that any two nonzero elements 
span a subspace of dimension two. 

Next we introduce the stronger typicality assumption used in [5]. We say 
that G is {c,h)-typical if 

I G(a;)| = (1 ± |5|c)d(G)l‘^ln for any S C V{G) with I^I < h. 

Note that being c-typical is essentially the same as being (c, 2)-typical (up 
to a factor of 2 in c). For most of the paper we will assume that G is (c, 16)- 
typical; at the end we will explain how the proof can be modified to work 
with the weaker assumption that G is c-typical. 

Now we define the typicality condition for {G, G*) and show that it holds 
whp. Let G* be a subgraph of G. We say that (G, G*) is (c, h)-typical if 

I G*(a:)n G(x)| = ( 1 ± |5|c)fi(G*)l®*ld(G)l®l-l^*ln 

xes* x(^s\s* 

for any 5* C 5 C V{G) with IS"! < h. 

Lemma 2.5. whp d{G*) = {1 ± 3c)'yd{G)^ and {G,G*) is {()c, 16)-typical. 


The proof uses the following consequence of Azuma’s inequality. 

Definition 2.6. Let Sn be the symmetric group, / : 5^ —)• M and 6 > 0. 
We say that / is b-Lipschitz if for any a, a' G Sn such that cr = r o cj' for 
some transposition t £ Sn we have |/(ct) — f{cr')\ < b. 


Lemma 2.7. (see e.g. m) Suppose f : Sn ^ 
uniformly random and X = /(<t). Then 

F(|A:-EA| >t)< 2e-*"/2n62_ 


is b-Lipsehitz, a £ Sn is 


Proof of Lemma 2.5 We start by estimating E|G*| = ^ 


eeG‘ 


£ G* 


For any e = xy, given 7r(x) and 7r(7/), we have e G G* if and only if tt{z) = 
7 r(x) -|- TT{y) for some 2; such that xyz £ K'i{G). Since G is (c, 16)-typical, 
there are (1 ± 2c)d{G)‘^n choices for 2 . Each satisfies 7r(2;) = 7r(x) -|- 7r(y) 
with probability (2“ — 3)“^, so E|G*| = |G|(1 ± 2c)d{G)‘^n{2°‘ — 3)“^. We 
can view vr as u o vro, where ttq : V (G) —>■ F 2 <i \ {0} is any fixed injection and 
(T is a random permutation of F 2 a \ {0}. Any transposition of a affects |G*| 
by 0(n), so by Lemma 2.7 whp d{G*) = (1 ± 2.1c)7fi(G)^. 

Similarly, we consider any S* C S C V{G) with \S\ < 16, write Y = 
fixes* G*ix) n r\xes\s* and estimate E|y| = Y)yev{G) ^ 

any y £ nxesG(x), given 7r(y) and '7r(x) for all x £ S, we have y G T if and 
only if for all x £ S* there is xyzx £ ^^{G) such that it{zx) = 7r(x) -|- 7r(y). 
Since G is (c, 16)-typical, there are (1 ± |5|c)d(G)l‘^ln choices for y. By 
excluding 0(1) choices of y we can assume 7r(x) -|- 7r(y) 7^ 7r(x') for all 
X, x' G S. Then for each x £ S* there are (1 ± 2c)d{G)‘^n choices for Zx, and 
for any set of choices, with probability (1 -|- 0(l/n))2““l'^*l they all satisfy 
'^{zx) = 7r(x) -|- vr(y). This gives 

E|y| = 0(1) + (1 ± |S|c)(i(G)l^ln • ((1 ± 2c)d{G)^n)^^*^ • (1 + 0(l/n))2-“l^*l. 




Any transposition of a affects |y| by 0(1), so by Lemma |2.7| whp \Y\ = 
(1 ± {3\S\ + l)c)d(G)l^l(7fi(G)2)l5*ln = (1 ± 6|5|c)d(G*)l^ld^^l-l^‘ln. □ 

Since d{G*) = (1 ± 3c)'^d{GY and 1/4 < 7 < 1/2 we have 0.24d(G)^ < 
d{G*) < 0.51d(G) for small c. Also, as (G, G*) is ( 6 c, 16)-typical we can 
deduce that G \ G* is 50c-typical. Indeed, for any v G V (G) we have 

|(G \ G*)(u)| = (1 ± c)d{G)n - (1 ± 6c)d{G*)n 
= {d{G) - d{G*))n ± 6c{d{G) + d{G*))n = (1 ± 20c)d(G \ G*)n. 

Furthermore, for any u,v G V{G) we estimate |(G \ G*)(u) n (G \ G*)(u)| as 

|G(u) n G(u)| - |G*(u) n G(u)| - |G(u) n G*(u)| + |G*(u) n G*(u)| 

= (1 ± 2c)d{Gfn - 2(1 ± 12c)fi(G)d(G*)n + (1 ± 12c)d(G*)2n 
= (d(G) - d{G*)fn ± 12c(d(G) + d{G*)fn = (1 ± 50c)d(G \ G*fn. 

Applying the following theorem, we deduce Nibble with ci = (50c) 

Theorem 2.8. There are bo > 0 and no Gfi so that if n > no, n~^'^ < b < 
bo and G is a b-typical graph on n vertices with d{G) > b, then there is a 
set N of edge-disjoint triangles in G such that L = G \ UN is b^/‘^-bounded. 


We remark that the parameters in Theorem 2.8 are not very sharp: we 
have just fixed some convenient values that suffice for our purposes. Similar 
results are well-known, but we are not aware of any reference that implies 
the theorem as stated, so we will sketch a proof in Section 

For convenient reference, we give here the values of some other parameters 
that will be used below: 


C2 = 102cid(G)-®, C3 = 1020c2d(G)-5° 

C 4 = 10 ^°C 3 (i(G)“^°° and C 5 = 10 ^°C 4 d(G)“^®°. 


The tightest constraint on c that will be required in our calculations is 
IOOC 5 = 10®^(50c)^/^d(G)“^^® < 10“®(i(G)^®°; this holds for small cq if c < 
cod{G)^^^^. (This is the bound we need if G is (c, 16)-typical, but if G is 


c-typical we need the stronger bound in Theorem 2.1 


2.4. Cover. Consider the following random greedy algorithm. Let L = {cj : 
i G [t]} (with edges ordered arbitrarily). Let M'^ = {Tj : i G [t]} be triangles 
such that Ti consists of e, and two edges of G*, and is chosen uniformly 
at random from all such triangles that are edge-disjoint from all previous 
choices; if there is no available choice for Tj then the algorithm aborts. 

To analyse the algorithm we require a concentration inequality. We 
say that a random variable Y is {p,G)-dominated, if there are constants 
pi,..., Pm with YllLi hi < we can write Y = such that 

\Yi\ <G for all i, and conditional on any given values of Yj for j < i we have 
E|li| < Pi. The following lemma follows easily from Freedman’s inequality 
|3] (see [5l Lemma 2.7]). 

Lemma 2.9. IfY is {p, G)-dominated then P(|T| > 2p) < 
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Sometimes we will use a modified inequality with 2 replaced by 1 + c. 
We also note that if the Yi are independent (not necessarily identically dis¬ 
tributed) indicator variables we recover a version of the Chernoff bound for 
(pseudo)binomial variables (where better concentration is known). For the 
following lemma, we recall that L is ci-bounded, where ci = (50c)and 
that C 2 = 10^ci(i(G')“®. 

Lemma 2.10. whp the algorithm to choose does not abort, and S := 
G* n (UM'^) is C 2 -bounded. 


Proof. For i G [t] we let Bi be the bad event that Si := G* D (Uj<iTj) is 
not C 2 -bounded. We define a stopping time r be the smallest i for which Bi 
holds or the algorithm aborts, or oo if there is no such i. It suffices to show 
whp r = oo. 

We fix to £ [t] and bound P(r = to) as follows. For any i < to, since 
Bi does not hold. Si is C 2 -bounded. Writing Cj = Viv[, we can bound the 
number of excluded choices for Tj by C 2 n < \G*{vi) n G*{v[)\/ 2 , so at most 
one half of the triangles on e, are excluded. 

Next we fix e = vv' G G* , and estimate Cg := where 

P' denotes the conditional probability given the choices made before step i. 
We compare Cg to the expected number of times that e would be covered if 
we chose all triangles independently. To be precise, we let 

i^g:= J]P(eCr'), 

i<to 


where each T- is a uniform random triangle consisting of Cj and two edges 
of G*, and (T/ : i G [t]) are independent. By the bound on excluded choices, 
P'(e C Ti) < 2P(e C T/), so rg < 2Flg. 

The ith summand in Ef. is only nonzero when Ci Ci e ^ As L is Ci- 
bounded, the number of such i is at most \L{v)\ + \L{v')\ < 2cin. Also, for 
each i such that U e spans a triangle, we have 

p(e C T') = \G*{vi) n G*{v[)\-^ < 2d{G*)-‘^n-^. 

Therefore < 4cid(G*)“^ < C 2 / 4 . 

Finally, fix u G y{G) and consider X = |StQ(u)| = Ylii<to^i^ where 
= UveeeG* leCTi- We have \Xi\ <2 and 


J^E'(Xi) = ^ r{eCTi)= ^ rg<C2n/2. 

i<to vGeGG* v£e£G* 


By Lemma 


2.9 


we have P(A > C 2 n) < 2e Taking a union bound 

over i <to <t, whp |S'(u)| < C2n, i.e. S is C2-bounded and r = 00. □ 


Below we will require several more random greedy algorithms similar to 
that above. One could formulate an abstract general lemma to cover all cases 
(see O Lemma 4.11]), but here we will prefer the more intuitive approach 
of identifying the key principles of the proof, so that it will be clear how it 
may be adapted to future instances. For a general random greedy algorithm, 
we identify some desired boundedness conclusion, then at each step of the 
algorithm, assuming that boundedness has not failed, we show that at most 
one half (say) of the choices of the required configuration have been excluded. 
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Then for each edge e in the underlying graph H we estimate the expected 
number of times that e would be covered if we chose all conhgurations 
independently. If Eg < 6/4 and the conhgurations have constant size (not 
depending on n) then the graph of all covered edges is whp 6-bounded. 

We record some estimates that are useful for such arguments. Suppose H 
is a small hxed graph (|^f| < 500 say), E C V{H) and (p is an embedding of 
H[F] in G*. We call E = (p, F, H) an extension. Let Xe{G*) be the number 
of embeddings (jf of H in G* that restrict to p on F. We suppose that E is 
Its-degenerate, meaning that we can construct the embedding one vertex at 
a time, so that at each step we add a vertex adjacent to at most 16 existing 
vertices. As {G,G*) is (6c, 16)-typical, when we add a vertex adjacent to 
t < 16 existing vertices, there are (1 ± 6tc)d{G*Yn choices. Multiplying 
these estimates, we obtain the following estimate for Xe{G*). 

Lemma 2.11. Suppose E = {(p,F,H) is a 16-degenerate extension with 
\H\ < 500. Then 

XEiG*) = (1 ± 

Now suppose that we wish to exclude embeddings p* that use some edge 
in J, which is c-bounded. Fix e £ El \ E[[F] and consider the embeddings 
(p* with p*{e) £ J. If e n T 0 there are at most cn choices for the 
embedding of e then at most choices for the remainder of p*. 

If ePlT = 0 there are at most cre^ choices for the embedding of e then at most 
jpy choices for the remainder of p*. Thus at most \H\cn^^ 
choices of p* are excluded, which is a negligible fraction of Xe{G*). 

3. Integral relaxations 

In this section we establish Hole. Our first step is to consider an integral 
relaxation, in the following sense. Instead of thinking of (S', UM*) as a 
partition of UM°, we think of S as a weighted sum of edge sets of triangles, 
where triangles in M° have weight 1 and triangles in M* have weight —1. We 
can express this by the equation = S, where $ is the corresponding ±1- 
vector indexed by triangles, and A is the inclusion matrix of triangles against 
edges, i.e. Af^, = Igc/ for any edge e and triangle /. It is straightforward 
to show that this equation has a solution if we allow to have any integer 
weights on triangles (see [iiiiaE] for more general results). 

It will be more convenient to work with linear maps rather than matrices. 
For any graph H we define Z-linear boundary/shadow maps dj : —>■ 

for i > j > 0 by dj{e) = ^ (p for e G Ki{H), i.e. for J G 
and / G Kj{H) we define dj{J)f = YhfCe&KdH) example, if J G Z^ 

then di{J) £ is defined by cli(J)u = X- 

It will also be notationally convenient to identify vectors with (gener¬ 
alised) sets. It is standard to identify v £ {0,1}^ with the set {x £ X : 
Vx = 1}. Similarly, we can identify v £ with the multiset in X in which 
X has multiplicity Vx (for our purposes 0 G N). We also apply similar nota¬ 
tion and terminology as for multisets to vectors v £ TX (‘intsets’). Here our 
convention is that ‘for each x £ E means that x is considered \vx\ times in 
any statement or algorithm, and has a sign attached to it (the same as that 
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of Vx)', we also refer to x as a ‘signed element’ of v. For v G we write 
V = v~^ — v~, where Vx = max{r;a;,0} and = max{—n 3 ;, 0 } for x £ X. 
Given J £ and v £ V{G), we define J{v) £ by J{v)u = Iuv&gJuv 

Then we can extend the definition of bonndedness to multigraphs: J is 
c-bounded if |J(n)| < cn for every v £ V{G). 

With this notation, our integral relaxation of Hole is expressed by the 
following lemma (in which Kn denotes the complete graph on V{G))] for 
Hole we will need the additional properties that $(/) = 0 for any / £ 
K^{Kn) \ K^{G*), and $(/) £ {0,1, —1} for all / G K^i^G*), as then we can 
write = M° — Mb 

Lemma 3.1. There is ^ £ yjith = s such that 82 ^^ is IOOC 2 - 

bounded. 

Proof. We will construct <1) = <l)o + ‘I’l + ‘ 1*2 such that = 8 — 82 ^ 0 , 
— 82 ^ 1 , — 82^2 satisfy 9* J* = 0 for i = 0,1, 2. Recalling 

that S is tridivisible, each J* will be tridivisible, in the ‘intgraph’ sense: i.e. 

Jg is divisible by 3 and Ylu "^uv is divisible by 2 for all v. 

Step 0: For d>o, we choose jS'l/S independent uniformly random triangles 
in Kn'-, then = S — 82^0 satisfies 8 qJ^ = 0 . For each vertex v, the number 
of these triangles containing v is binomial with mean |S'|/n < C 2 n/ 2 , so by 
the Chernoff bound whp ^ 2^0 is l.lc 2 -bounded. 

Step 1: We let J* = 9iJ°, so 80 J* = 25oJ° = 0, i.e. | J*+| = |J*-|. Note 
for all X £ V{G) that J* is even, as is tridivisible, and | J*| < l.lc 2 n. We 
fix an arbitrary sequence ((x^,x^) : i £ [|J*'’“|/2]) so that each x £ V{G) 
occurs Jx^l2 times as some xf and J%~l‘l times as some x~. For each i 
we choose aibi C V{G) \ {xf,x~} independently uniformly at random, and 
let $1 = ^i^[\j*+\/ 2 ]i{xta-ibi} - {x~aibi})', then = jO - 02 ^^! satisfies 
81 = 0 . 

We claim that whp 82 ^^ are 8 c 2 -bounded. To see this, we first fix any 
e G Kn and estimate the expected contributions to e from each step i, 
according to whether e contains xf, x~, or neither. Each endpoint of e 
occurs at most 0 . 6 c 2 n times as xf, and for such i we cover e with probability 
2 /(n — 2 ), so the expected contribution to { 82 ^f)e from all such i is at most 
2 . 5 c 2 . At any other step, we cover e with probability ("' 2 ^) ’ total 

expected contribution to { 82 ^f)e from these steps is at most I.IC 2 . Now, for 
each vertex v, summing over its incident edges, 102 ^^^(n)| are both ( 4 c 2 n, 1 )- 
dominated, so the claim holds by Lemma |2.9[ 

Step 2: We start by fixing an arbitrary expression = Ylc&Co ''^here 
each C is a closed walk in G* with edge weights alternating between 1 and 
— 1, and there are no cancellations, i.e. every edge appears in the sum only 
with weight 1 or only with weight —1. As is well-known, such an expression 
may be found by a greedy algorithm: each C can be obtained by following 
an arbitrary alternating walk on the signed elements of until we return 
to our starting point using an edge with the opposite sign to that of the first 
edge, whereupon we add —G to and repeat the procedure. (We note that 
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Figure 1. Decomposing even signed cycles. 


this argument leads to a convenient shortcut for triangle decompositions, 
but does not generalise to hypergraph decompositions.) 

Next we express each C G Cq as a sum of signed four-cycles in the complete 
graph Kn on V(G), where we write each closed walk of length 2 m as a chain 
of m — 1 signed four-cycles, using the identity (see Figure 1) 

m—1 

- {xi+ivi+i} + {yi+iVi} - {ViXi}) 

i=l 

m—1 m—1 

= {xiyi] + (-l)™{xm 2 /m} + '^{-iy{xiXi+i} -h ^(-l)*{yjyi+l}. 

i=l i=l 

This identity can be used as is if Xi ^ yi for i G [m]. For each i such that 
Xi = yi, we note that 1 < i < m, Xj_i ^ Xj+i / yi+i, and Xj+i / yi-i, 
so we can replace the four-cycles for summands i — 1 and i by 

{{xi-ixi} - {xiXi+i] -t- {xi+iyi-i] - {yi-ixi-i}, and 

{-iy{{xi+iyi-i} - {yi-iyi} + {ym+i} - {yi+iXi+i}). 


Thus we can write = YZceC where each summand is a signed four¬ 
cycle in Kn- Furthermore, the above construction has the property that for 
each V G V{G) and w G {—1,1} we use at most 3| < 24c2n edges at 

V with weight w. 

For each G = {ab} — {bc}+{cd} — {da} G C we choose x G V{G)\{a, b, c, d} 
independently uniformly at random, and add {xab} — {xbc} + {xcd} — {xda} 
to <h 2 ; then 82^2 = ~ ^ denote the multigraph formed 

by summing {xa, xc, xd} over all such G. For any e G Kn, at most 
48c2n elements of C can contribute to Fg, so EFe < 49c2n. Then for any 
V, summing over its incident edges, |r(u)| is (49c2n,4)-dominated, so by 
Lemma 2.9 (modified) whp T is 50c2-bounded. Defining <1> = <l>o -|- <hi -|- <^ 2 , 
we have 82 ^ = S and 82 ^~^ is 100c2-bounded. □ 


To obtain Hole, we will modify 4> using the following ‘octahedral’ con¬ 
figurations (see Figure 2). Consider a copy of K 2 ^ 2 , 2 , the complete tripartite 
graph with 2 points in each part, with parts {(j, 0), (j, 1)} for j G [3]. We 
denote its triangles by {fx : x G {0,1}^}, where fx = {{j,Xj) : j G [3]}. The 
sign of fx is s{fx) = (—1)^’”. Thus each edge is in one triangle of each sign. 
Dehning D = i}^ ^{fx){fx} £ 'Z^ 3 (K 2 , 2 , 2 ) ^ ggg 82 ^ = 0. This 

gives a method to eliminate any signed triangle / from without altering 
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( 3 , 0 ) 


( 1 , 0 ) 



( 2 , 0 ) 


"( 3 , 1 ) 

Figure 2 . An octahedron with signed triangles. 


^ 2 ^: we add some copy of Q with the opposite sign to / in which (say) 
/ooo = /) thus replacing / by seven other signed triangles that have the 
same total 2-shadow. Similarly (and more importantly), we can eliminate 
any pair of triangles /, /' that have opposite sign and share an edge e, replac¬ 
ing /, f' by six other signed triangles that have the same total 2-shadow and 
do not use e. We apply this method in the following two-phase algorithm. 

Octahedral Elimination Algorithm (Phase I). We eliminate all triangles in 
<h, according to a random greedy algorithm, where in each step we consider 
some original signed element / of and choose an octahedral configuration 
12/ to replace /. We refer to edges of 12/ not in / as new edges, and choose 
12/ uniformly at random subject to the new edges belonging to G* and being 
disjoint from 82 ^^ and all new edges from previous steps. 

Let denote the result of Phase I (if it does not abort). Then 82 ^' = 
82 ^ = S, and we can write 82 ^'~^ = 82 ^~^ + F, where F is the graph of new 
edges, and every signed element of contains at most one edge of 82 ^'^■ 

Octahedral Elimination Algorithm (Phase II). We replace all signed edges 
apart from those in S and F. To do this, we fix a sequence S of pairs of 
signed elements of so that (i) for each //' G 5, there is some e G 02 ^^ 
such that / and f both contain e, and / and /' have opposite signs, and 
(ii) the multiset consisting of all e as in (i) is 82 ^~ ■ Now we eliminate each 
//' G S, according to a random greedy algorithm, by subtracting some copy 
12 /// of 12 with /ooo = / and /ooi = /^ or vice versa, depending on the 
signs. We refer to edges of 12/// not in / or f as new edges, and choose 
12/// uniformly at random subject to the new edges belonging to G* and 
being distinct from 82 ^^ U F and all new edges from previous steps. 

Let T denote the result of this algorithm (if it does not abort) and F' the 
graph of new edges for Phase II. Then 82 ^ = S and 82 ^~ = F U F' C G*. 
This implies '!/{/) = 0 for any / G KsiKn) \ K^iG*), and T(/) G {0,1, -1} 
for all / G K^{G*), so 'h = M° — M*, where M° and M* are as in Hole, 
once we have verified the boundedness condition. 
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Lemma 3.2. whp the Octahedral Elimination Algorithm produces M° and 
M* as in Hole. 


Proof. We first show that whp F is c^-bounded, where c '2 = 10 ^C 2 d(G'*)“® 
The proof follows the discussion after the proof of Lemma 2.10[ where a 
configuration for / consists of the new edges of some Olf. By Lemma 2.11 


at each step, the number of choices of Of with all new edges belonging to G 
(with no excluded configurations) is (1 ± 60c)d{G*)^n^. Assuming that the 
graph of previous new edges is C 2 -bounded, as ( 92 ‘h''' is 100 c 2 -bounded, the 
number of excluded conhgurations is at most lOc^n^, which is less than half 
of the total. Next, for each e € G*, we consider separately the contributions 
to Ef,, according to whether e intersects / in 0 or 1 vertex (there is no contri¬ 
bution to new edges from triangles containing e). There are at most 600c2n 
signed elements of that intersect e in 1 vertex. For each of these, a random 
configuration covers e with probability at most 3n^/(l — 60c)d{G*)^n 
the total contribution to Ee from such elements is at most 200002 ^( 6 *' 
Also, <1> has at most 100 c 2 n^ signed elements, and for each one that is dis¬ 
joint from e the contribution to Eg is at most 6 n/(l — 60c)d{G*)^n^, so the 
total contribution from such elements is at most 1000 c 2 fi(G*)“®. We obtain 
Ee < 3000c2fi(G*)“®, which implies the claimed bound on T. 

Next we claim that whp T' is C 2 -bounded, where C 2 = 20 c2d{G*)~'^. The 
argument is very similar to that given for T. Now a conhguration for ff 
consists of the new edges of some 12 //'. 


so 
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By Lemma 2.11 


at each step, 

the number of choices of Off/ with all new edges belonging to G* (with no 
excluded configurations) is (1 ± 50c)d(G*)^n^. Assuming that the graph of 
previous new edges is C 2 -bounded, as 82 ^^ U T is 2c2-bounded, the number 
of excluded conhgurations is at most 10 c 2 n^, which is less than half of the 
total. Next, for each e G G*, we consider separately the contributions to 
Ee according to whether e intersects / U /Mn 0 or 1 vertex (there is no 
contribution to new edges if e C / U /'). 

First we consider those ff' G S that intersect e in 1 vertex x. There are 
two choices for x £ e. If x G / H /' then there are at most 200 c 2 re choices 
for / n /' G 82 ^^ U 82 ^~, which determines / and /'. If {x} = f \ f 
then there are at most |F(x)| < C 2 n choices for /, and so /'. The same 
bound applies if {x} = f' \ f, so there are at most 5 c 2 n such //'. Each 
contributes at most 2 n/(l — 50c)d(G*)^n^ to Ee, so the total contribution 
from such ff' is at most llc' 2 d{G*)~'^. Also, |5| = | 92 ‘^“| < 100 c 2 n^, and 
for each ff' G S with e H (/ U /') =0 the contribution to Ee is at most 
2/(1 — 50c)d{G*yn'^, so the total contribution from such elements is at most 
300c2d(G*)“^. We obtain Ee < 12 c' 2 d(G*)~'^, which implies the claimed 


bound on F'. Recalling that d{G*) > 0.24d{G)'^ and C3 = 10^'^C2d(G) we 
see that UM° = 82 ^^^^ = 5 U F U F' is C3-bounded, so we have the required 
properties for Hole. □ 


4. Completion 

For Completion, we divide the analysis into two parts. Firstly, we will 
determine what conditions on Mi and M 2 enable us to hnd M 3 and M 4 . 
Secondly, we will show that the sets M^, M° and M* from Cover and Hole 
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can be modified to give Mi and M 2 satisfying the required conditions. For 
convenient notation we suppress the embedding tt : V{G) —>■ F 2 a whenever 
we do not need to refer to it, instead thinking of V{G) as a subset of F 20 . 


4.1. Shuffles. Suppose we have a set M 2 of edge-disjoint triangles in G*, 
and we want to find sets M3 and M4 of edge-disjoint triangles in G* such that 
UM3 = UM4, M3 C T and M2 C M4. Our basic building blocks (‘shuffles’) 
will be edge-disjoint subgraphs of G*, each having two different triangle 
decompositions, one only using triangles in T, and the other including any 
specified triangle of M2. Then the unions over all blocks of the two triangle 
decompositions will give M3 and M4 as required. 

We define the shuffles as follows. Fix x = {xi,X2,X3) £ F^a and t = 
{ti,t2) £ F^a such that {xi,X2,X3,ti,t2} is linearly independent over F2. 
Let (x) be the subspace of F2a generated by {xi,X2,X3}. The xt-shuffle Sxt 
is the complete tripartite graph with parts ti + {x) = {ti+y : y £ (x)}, i £ [3], 
where := ti -£ t2- If Sxt ^ G* then it has a triangle decomposition M^xt 
only using triangles in T: take all triangles 2/12/22/3 where each yt £ U + (x) 
and 2/1 + 2/2 + 2/3 = 0 - We define another triangle decomposition M4xt of Sxt 
by translating each triangle of M^xt by (xi,X2,X3), i.e. M4xt consists of all 
triangles 2/12/22/3 where each yt £ti + (x) and xi -|- X2 -|- X3 -|- 2/1 + 2/2 + 2/3 = 0 . 

To construct M3 and M4, we choose shuffles according to a random greedy 
algorithm, where in each step we consider some Z 1 Z 2 Z 3 £ M 2 , and choose 
some shuffle Sxt Q G* such that Zi = ti + xi for all i £ [3]. We will see 


in Lemma 4.1 that the Randomised Algebraic Construction is whp such 
that there are many choices for such a shuffle. This is the most important 
property of the construction, and it would not hold if we had chosen the 
template to be a uniformly random set of edge-disjoint triangles; in fact the 
expected number of shuffles (or any ‘shuffle-like’ configuration) would be 
0 ( 1 ). First we identify a property that we need for triangles in M 2 so that 
the required shuffles exist and can be chosen to be edge-disjoint. We say 
that Z 1 Z 2 ZZ is octahedral li zi + Z 2 + z^, ^ and there is a copy K' of 1^2, 2,2 
in G such that 7 r{K') has parts {zi, Z2 + Z3}, {z2, zi -t- 2:3} and {23, zi + Z2}; 
we call K' the associated octahedron of 2122 • 23 - We assume 

(PI) all triangles in M 2 are octahedral, with edge-disjoint associated oc- 
tahedra. 


Lemma 4.1. Under the random choice ofn used in the definition ofT, whp 
for any octahedral ziZ 2 Z^ there are (l± 200 c)(i(G)^®° 7 ^® 2 ^“ shuffles Sxt U G* 
such that ti Xi = Zi for f e [3]. 

Proof. We can write the number of such shuffles as a sum of indicator 
variables X = Yl^E(K,e,x,t)y where the sum ranges over all {K,i,x,t) such 
that K is a copy of ATs.s.s in G containing the associated octahedron K' of 
ziZ 2 Z^, f is a bijective labelling of each part of K by F^, we let E{K,i,x,t) 
be the event that tt{w) = ti + £(w) ■ x for all i £ [3] and w in the ith part 
of K, and we assume I is consistent with K', in that £(Tr~^ (zi)) = Ci and 
£(Tr~^{zi + Zj)) = ei + Cj for {i,j} C [3]. 

As G is (c, 16)-typical, there are (1 ± 181c)(i(G)^®°n^® choices of {K,£). 
There are 2^“ — 0{n) choices of t, which determines x given z, as only 0{n) 
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choices of t are excluded by the condition that {xi,X2,X3,ti,t2} is linearly 
independent over F 2 : there are 0 ( 1 ) possible linear relations between them, 
and each such relation is linearly independent or contradictory to the system 
ti + Xi = Zi for i G [3] (as zi + Z2 + Z'i 7 ^ 0), so is satisfied by at most 2“ 
choices of t. Given {K, i, x, t), conditional on 7r|x', we have F{E{K, i, x, t)) = 
(1 + 0(l/n))2-i^“. Therefore EX = {1 ± . 

Also, any transposition r of vr affects X by at most 100 -2“. To see 
this, we estimate the number of shuffles containing Z1Z2Z3 and any fixed 
V € ¥20. \ {zi,Z2,Z3,zi + Z2,ZI + 2:3, 2^2 + -^s}- Consider any j G [3], b G 
FI \ {sj, (1,1,1) — Bj}, and the equations ti + b ■ x = v and ti + Xi = Zi for 
i G [3] in {t,x). We have four linearly independent constraints, so there are 
at most 2“ solutions. Including multiplicative factors for i, b and r gives the 
required bound. Now by Lemma 2.7 whp A = (1 ± 200 c)(i(G)^®° 7 ^^ 2 ^“. □ 


4.2. Linear extensions. We digress to note a more general estimate for 

future reference. Suppose 77 is a graph, y = [vi ■ i G [g]) are variables, and 
for all V G V{H) we have distinct linear forms L^{y) = + Yli&Sv 

some Cv G F 2 a and C [g]. We call E = (L, H) a linear extension with base 
E = {v G V{H) : Sy = 0}. Let Xe{G*) be the number of L-emheddings 
of 77, i.e. embeddings of 77 in G* such that for some y G E^a we have 
4>{v) = Ly{y) for all v G V{H). The above argument (see also [5l Lemma 
5.15]) gives the following formula analogous to that obtained for shuffles. 

Lemma 4.2. Let E = (L, 77) be a 16-degenerate linear extension with \ H\ < 
500. Suppose 

• 77 has a triangle decomposition M such that for each xyz G M we 
have Lx T Ly = L^^ 

• The incidence matrix of{Sy : v G 17(77)} has full column rank g > 1. 

Then 

Xe{G*) = (1 ± l.l|77|c)d(G)l^\^[-^]l7'^^'^^^^'2^“- 

4.3. Shuffle algorithm. Recalling our general framework for random greedy 
algorithms, we want to show that, of the potential shuffles Sxt with U-Gxi = 
Zi for i G [3], at most half are excluded due to sharing an edge with a previous 
shuffle, assuming some boundedness condition on the graph T of new edges 
from previous shuffles. We classify the potential restrictions according to 
the label of the shuffle edge involved, which is specified by some {j, k} C [3] 
and bj, b^ G F^ such that bj ^ {(cj, (1,1,1) — ej) or bk ^ {(cfc, (1,1,1) — e^) 
(here we do not consider edges of the associated octahedra: these are al¬ 
ready determined, and edge-disjoint by (PI).) For any VjVf- G G*, the 
shuffles excluded because of mapping the given labelled shuffle edge to VjVk 
are given by the (x, t)-solutions of the system S of equations tj + bj ■ x = Vj, 
tk + bk- X = Vk and U + Xi = Zi for z G [3]. There may be 0, 1 or 2“ solutions. 
We can ignore the case of 0 solutions, as it does not exclude anything. For 
the cases with 1 solution, we can bound the number of excluded choices by 
the number of edges covered by all shuffles, which is 192|M2|. 

It remains to consider the case that S has 2“ solutions, which occurs 
when one of the equations is redundant, due to being a linear combination 
of the other equations. There are a constant number of linear combinations, 
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and each constrains {vj,Vk) to lie on a line, as may be seen from general 
considerations of linear algebra, or simply by enumerating the possibilities: 
wlog tk + bk ■ X = Vk IS redundant, due to 

(i) bk = Ck and Vk = Zk, 

(ii) bk = (1,1,1) - Cfc and Vk = zi + Z 2 + zs - Zk, 

(hi) bj + bk = ej + ek and Vj + Vk = zj + Zk, 

(iv) bj + bk = Ci and vj + Vk = Zi, where [3] = {i,j, k}. 

In cases (i) and (ii) where Vk is fixed, assuming that F is cs-bounded, there 
are at most c^n choices for Vj such that VjVk G F. In cases (hi) and (iv) we 
need an additional boundedness condition: 

We say that F is linearly c^-bounded if F is cs-bounded and also contains 
at most 0 ^ 2 °- edges from any line of the form {(xi + ^, X 2 + /i) : ^ G F 2 a}. 

We also need similar conditions so that we can avoid the associated oc- 
tahedra; writing A for the union of all associated octahedra of triangles in 
M 2 , we will ensure that 

(P2) A is linearly C 4 -bounded. 

Then the total number of excluded shuffles is at most 192(|M2| + (04 + 
€ 5 ) 2 ^“) < 200 c 52 ^“, which is less than half of the total. 

Next we fix e G G* and estimate Ee- To do so, we hx bj, bk as above, write 
e = VjVk and estimate the sum over Z 1 Z 2 Z 3 G M 2 of the probability p that a 
random shuffle Sxt with U + Xi = Zi for i G [3] satishes tj + bj ■ x = Vj and 
tk + bk ■ X = Vk- For fixed Z 1 Z 2 Z 3 , if the system S as above has N solutions 
then p = N/{1 ± 200 c)d(G)^®° 7 ^® 2 ^“. When N = 1 the total contribution 
is at most |M 2|/(1 - 200c)d(G)i®V®22“ < . If A = 2“ 

then {zi, Z 2 , Z 3 ) is constrained to he in a certain plane (this can be seen by 
linear algebra, or by considering each possibiity as above: e.g. in case (ih) 
the plane is Vj + Vk = Zj + Zk)- Thus we see the hnal property that we need 
from M 2 : 

(P3) M 2 contains at most 042 “ elements Z 1 Z 2 Z 3 from any basic plane of 
the form b ■ z = v where 6 G F 2 \ {0}. 

(Note that by (PI) we can assume n 7 ^ 0 in (P3).) Then the total contribu¬ 
tion is at most 042 “ • 2“/(l — 200 c)d(G)^®° 7 ^® 2 ^“. Summing over {j,k}, bj 
and bk, we can estimate Eg < 250c4(i(G)“^®°7“^^ = C5/4. Applying Lemma 


assumptions on F used above do not fail (linear boundedness follows in the 
same way as boundedness), and so the algorithm does not abort. This com¬ 
pletes the analysis of the first part of Completion: given Mi and M 2 as 
in Completion, under the conditions (P1-P3) on M 2 , we can hnd M 3 and 

M 4 as in Completion. 

4.4. Octahedral Elimination Algorithm. To complete the proof of Com¬ 
pletion, and so of the theorems, it remains to show that we can find Mi 
and M 2 satisfying the conditions (P1-P3). We apply a similar two-phase 
algorithm to that used in Hole. 

Phase 1. We start with = M'^ -|- M* — M°, so 82 ^ = L, d 2 ^~^ = 
U{M^ U M*), d 2 ^~ = UM°. Next we eliminate all triangles in <I> according 
to a random greedy algorithm, where in each step we consider some original 
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2.9 as in the proof of Lemma |2.10[ we deduce that whp the boundedness 





signed element / of and choose an octahedral conhguration to replace 
/. We say that a triangle f' of ilf is far if |/' n /| < 1, and that is valid 
if (i) none of its triangles are template triangles, with the possible exception 
of /, and (ii) all of its far triangles are octahedral, and their associated 
octahedra share edges only in Qf, in which case we denote their union by 
the extended configuration fljr. We say that an edge of not in / is new, 
and choose a valid 0/ uniformly at random subject to the new edges being 
distinct from all new edges from previous steps. 

Let denote the result of Phase I (if it does not abort). We have 82 ^' = 
02 $ = L, and writing P for the graph of new edges, every signed element 
of is either a far triangle consisting of three edges of P, or is not far and 
consists of two edges of P and one edge of d 2 ^~^■ 

Phase II. Now we will eliminate all triangles of apart from those that 
contain an edge of L or were far in the previous modification procedure. We 
partition all such triangles into a sequence S of pairs of signed elements of 
so that for each //' G S, there is some e G 82 ^'^ such that / and /' both 
contain e, and / and f have opposite signs. We eliminate each //' G S, 
according to a random greedy algorithm, by subtracting some copy Hff 
of n with /ooo = / and /ooi = /^ or vice versa, depending on the signs. 
Now we say that f^//' is valid if all of its triangles apart from / and f 
are octahedral, and their associated octahedra share edges only in in 

which case we denote their union by the extended configuration We 

refer to edges of not in / or /' as new edges, and choose a valid II.ff 
uniformly at random subject to the new edges being distinct from P and all 
new edges from previous steps. 

Let 'h denote the result of this algorithm (if it does not abort) and P' the 
graph of new edges for Phase II. Since 82 ^ = 82 ^ = L, defining Mi = 
and M 2 = 'L”, we see that UM 2 = P U P^ and UMi = L U P U P', so 
(L,UM 2 ) is a partition of UMi. The following lemma completes the proof 
of Completion, and so of the theorems, under the assumption that G is 
(c, 16)-typical. 

Lemma 4.3. whp M 2 satisfies (PI), (P2) and (P3). 


Proof. To analyse Phase I, we first estimate the number of choices for an 
extended configuration on a triangle /. This can be described by the linear 
extension where is as above, we have variables 2 ; = (zi,Z 2 ,Z 3 ), 

which we also use to label the vertices of Hf \ f, we define = x for all 
X € Hf, and define Lx for all other x as required for the far triangles in 
IIj to be octahedral, i.e. in the associated octahedron for a triangle abc, the 
linear forms on the two vertices in each of the three parts are {La, Lh + Lc}, 
{Lb, Lc + La} and {Lc, La + Lb}. By Lemma 4.2 whp G* is such that for any 
triangle / in <I>, there are (1 ±60c)d(G)'^^7^®2^“ valid choices of Qj. Here we 
also use the fact that for any triangle abc of Hf other than / there are only 
2 ^“ solutions to La{z) + Lb{z) + Lc{z) = 0. The precise exponents of d{G) 
and 7 (which are not important for the argument) may be easily calculated 
from the observation that adding an octahedron to a triangle adds 3 new 
vertices and 9 new edges, and 11^ is the composition of 5 such extensions. 
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Next we claim that whp the graph T of new edges is linearly c^-bounded, 
where C 3 = 400c3d(G)“^^7“^^. We assume this bound on the current graph 
of new edges and estimate how many configurations are excluded. Consider 
any edge uu' of the extended configuration. Suppose first that uu' n / = 0. 
If Lu{y) + Lu'{y) is not constant, then for any vv' G G* the number of L- 
embeddings with Lu{y) = v and L^'iy) = v' is at most 2“. There are at 
most + |M*| + |M°|) < lOOcare^ choices for a previous new edge vv', 

so this excludes at most 100 c 3 n^ 2 “ configurations. On the other hand, if 
Lu{y) + Lui{y) is constant, then Lu{y) and Lu'{y) are constrained to lie on a 
basic line; there are at most c'^ 2 ^ choices for vv' by linear boundedness, and 
each such vv' excludes at most 2^“ configurations. The latter estimate also 
applies to the case when one of u or u' is in /. Summing these bounds over 
all uu', we see that fewer than half of the total configurations are excluded. 

Next we fix any edge uu' of the extended configuration, any vv' G G*, and 
estimate the sum over / G of the probability p that a random configuration 
satisfies Lu{y) = v and Lu'{y) = v'. If uu' n / = 0 and Lu{y) + L^'iy) is not 
constant, then p < 2“/(l — 60c)(i(G)^®7^^2^“ for any / G ‘h. There are at 
most choices for /, so the total contribution is at most . 

Otherwise, if Lu{y) + Lu'{y) is constant or one of u or u' is in /, then one 
vertex of / is specified by {Lu{y), Lu'(y))■ For example, writing / = abc, 
in the associated octahedron for az 2 Z 3 , if n = Z 2 and u' = a + Z 2 then 
a is specified by {Lu{y), Lui (y)). Then there are at most 2 c 3 n choices for 
/ (as UM° is C 3 -bounded). For each such / we have a contribution of 
at most 2 ^“/(l — 60 c)(i(G)^® 7 ^® 2 ^“, so again the total contribution is at 
most 2 c 3 (i(G)“^^ 7 “^®. Summing these bounds over all uu' we can estimate 
Eyyi < 100 c 3 (i(G)“^® 7 “^^ = c'g/d. Applying Lemma 2.9 as in the proof of 
Lemma |2.I0[ we deduce the claimed bound on F. 

We also claim that whp there are at most 2c'^2^ far triangles in any basic 
plane II = {z : b ■ z = v}. To see this, we first consider the contribution 
from the template triangles 11* = 11 n T. Since zi + Z 2 + Z 3 = Q \s linearly 
independent or contradictory to the defining equation of Ff we have |n*| < 
2“. Summing < c'^j4: over an edge vv' in each triangle of 11*, by Lemma 


2.9 whp n contains at most c'^2^ template triangles. Now fix any far non¬ 


template triangle f of the extended configuration, any g G K 3 {G*), and 
estimate the sum over / G d> of the probability p that a random configuration 
satisfies Lf>{y) = g. If /'n/ = 0 then as f is non-template it determines the 
configuration, so p < 1/(1 — 60 c)ci(G)'^® 7 ^^ 2 ^“, giving a total contribution of 
at most 2 c 3 d(G)“^^ 7 “^^n“^. Otherwise, f determines one of the associated 
octahedra, so specifies one vertex of /, for example, writing / = abc, if 
f = {z 2 ,a + Z 2 ,a + Z 3 } then a is specified. Then there are at most 2 c 3 n 
choices for /; for each such / we have p < 2 “/(1 — 60 c)(i(G)^^ 7 ^® 2 ^“, so again 
the total contribution is at most 2 c 3 (i(G)“^® 7 “^^n“^. Summing over f and 
applying Lemma |2.9| as in the proof of Lemma 2.10[ we deduce the claimed 
bound on 11. This completes the analysis of Phase 1. 

To analyse Phase II, we first estimate the number of choices for an ex¬ 
tended configuration on a pair ff. This can be described by the linear 
extension (Q^j,,L), where 12 is as above, we have variables 2 ; = ( 2 ;i, 2 ; 2 ), 
which we also use to label the vertices oiIlff> \ (fuf), we define = x for 
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all X G , and define Lx for all other x as required for the triangles mQff 
other than / and f' to be octahedral. Th e lin ear forms are distinct, as / and 
f are not template triangles. By Lemma 4.2 there are (l±60c)d(G)^^7^®2^“ 


valid Llfft. Again, the precise exponents of d{G) and 7 are not important 
for the argument, but are straightforward to calculate: e.g. 7 appears with 
exponent 20, as 14/// and each of the 7 associated octahedra adds 3 new 
vertices to the extension, but 4 vertices in the associated octahedra belong 
to the base of the extension, being the third vertex of the template triangle 
containing an edge in / or f other than e. 

We claim that whp T' is linearly C4/2-bounded. The argument is very 
similar to that given above for T. Assuming this bound on the current 
graph of new edges, one can show similarly to before that fewer than half of 
the total configurations are excluded. We also need to estimate the sum over 
/ G of the probability that a random configuration satisfies L„(y) = v 
and Lui{y) = v', for any uu' in the extended configuration and vv' G G*. 
For most choices for uu' the required bound follows as before, but there is 
an additional case, namely when uu' n (/ U /') = 0 and Lu{y) + L^'iy) is 
constant, it may be that no vertex of / U /' is specified by {Lu{y), L^j/(?/)), 
but instead some pair (not e) is constrained to lie on a basic line. For 
example, writing / = abc and f = abc', li u = b + zi and u' = c + zi then 
{Lu{y), Ly^i(y)) specifies b + c, but not b or c. In this case, we use the fact 
that F is linearly c^-bounded to see that there are at most c'^2°' choices for 
//'. Each such //' contributes at most 2“/(l — 60c)(i(G')^^7^°2^“, giving 
a total contribution of at most 2c'^d{G)~^^'y~‘^^. Summing over all uu' we 


estimate E,, 


< 100cfjd(G)-537-20 


so the claim follows from Lemma 2.9 


Finally, M 2 satisfies the conditions (P1-P3): indeed, (PI) holds by defi¬ 
nition of the extended configurations and random greedy algorithms, (P2) 
holds as A C F U F', and (P3) holds as whp T has at most C 4 n triangles 
in any basic plane: this holds for the new triangles in this algorithm by the 
same argument as for 4>', and we may include the far triangles from the 
previous algorithm in this estimate. □ 


This completes the proof of Completion, and so of Theorem 2.1, under 
the assumption that G is (c, 16)-typical. The following modification proves 
the theorem under the assumption that G is c-typical. It is well-known 
that G is c^/^*^-regular (say) in the ‘Szemeredi sense’ (see e.g. [ 6 l Theorem 
2.2]). For S C V{G) with jS*] < 16, say that S is good if | na;e 5 G(3:)| = 
(1 ± |S'|c^/^'^'^)d(G)l'^ln, otherwise bad. As G is c-typical, there are no bad 
sets of size 1 or 2. By regularity, for A: < 15, any good fc-se t is contained 
in at most bad {k + l)-sets. The proof of Lemma 


2.5 


still applies if 

we assume that all subsets of S are good. Thus we can avoid using bad sets 
with negligible changes to the calculations. 


5. Random triangle removal 


In this section we sketch a proof of Theorem 2.8 by describing how to 


apply the analysis of random greedy hypergraph matching by Bennett and 
Bohman [T] (we choose this for simplicity, but there are several other alter¬ 
native approaches). 
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Consider the random triangle removal algorithm starting with G rather 
than Kn- The intuition is that after i steps the remaining graph G* should 
look like a random subgraph of G where edges are retained independently 
with probability p = 1 — 3t, where t = i/\G\. For any e G G let Te{i) denote 
the number of triangles of G* containing e and Q{i) denote the total number 
of triangles of G*. By assumption, re(0) = (1 ± b)D for all e G G, where 
D = d{G)‘^n. Note also that <5(0) = (1 ± b)\G\D/3. 

We will show that for 0 < z < (1 — 6^/‘^)|G|/3 whp 

Q{i) = \G\Dp^/3 ± Bq and Te{i) = Dp^ ± for all e G G*, 

where = 2(1 — 3logp)^ 6 |G|Zl and ed = 2(1 — 3\ogp)b‘^/^D. We restrict 
attention to the upper bounds, as the lower bounds are similar. A convenient 
reformulation is to show negativity of shifted variables 

Q^{i) = Q{i)-\G\Dp^/3 - Bq and (i) = Te{i) - Dp^ - Bd- 

This follows whp from martingale concentration inequalities (e.g. m) after 
we verify the following ‘trend’ and ‘boundedness’ hypotheses, supposing that 
the required estimates for Q and Tg hold at previous steps (i.e. i < t, where 
the stopping time r is the first step where any of the required estimates fails, 
or oo); we use primes to denote conditional expectation given the history of 
the process. 

Trend hypothBsis: If Q~^{i) > —b\G\D then + 1) < if 

T+(i) > -62/3/) then E'T+(z + 1) < T+{i). 

BoundsduBss hypothBsis: + 1) — (5'''(z))2|G|plogn < (6|G|T1)2, and 

— 0 < T+(z + 1) — Tg+(i) < 9 with 9 < 0/10 and 00|G|plogn < {b'^/^D)‘^. 

We start with the boundedness hypothesis, which holds with room to 
spare. Indeed, |(5’'“(z+l)—(5’''(z)| = 0(n), so ((5’'“(z+l)—Q’''(z))2|G|plogn = 
O(n^logn), whereas {b\G\D)‘^ > b'^d{G)^n^/4: > n®'®/4. Also, for any b € G^ 
we have —1 < Te{i + 1) — Te(i) < 0. The change in Dp^ is 0{Dp/\G\) and 
in Bd is 0(62/3 Z)/p|G|), so we can take 0 = 2 and 9 = 0{p + b"^/^/p)D/\G\. 
Then d0|G|plogn = 0{p^ + b‘^/^)D\ogn = O(nlogn), whereas (62/3/1)2 = 
6^/3d(G)4n2 > n^-^. 

Next consider the trend hypothesis for Q. Conditional on the required 
estimates at step z, if Q^{i) > —6|G|Z1 we have 

E'[Q(z + 1) - Q{i)\ = -Q{i)-^ Y. + <^(1) 

esG* 

= -9Q(z)/|G|p ± 2|G|pe2/g(i) + 0(1) 

< —3T>p2 — 9eg/|G|p + 9bD/p + (6 + o(l))e^/T)/»2. 

Also, as Bq is increasing, the one-step change in —\G\Dp^/3 — Bq is at most 
(1 -I- 0 (l/|G|)) 3 T>p 2 . As p > 6 ^/“^, we deduce 

E'[Q+(z + 1) - Q+(z)] < -9eg/|G|p + 9bD/p + 7e^/T>p2 
< —18(1 — 3logp)‘^bD/p + 9bD/p + 28(1 — 3 log/»)26^/^T>/p2 < Q. 
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Now consider the trend hypothesis for Tg. Conditional on the required 
estimates at step i, if T^{i) > we have 

E'[rg(i + 1) - Te(i)] = -Q{i)-^T^{i)2{Dp^ ± erf + 0(1)) 

< -2Q{i)-\Dp^ + erf - h^/^D){Dp^ - erf - 0(1)) 

< -Q{Dp^f/\G\Dp^ + (6 + o{l)){b^/^D){Dp^)/\G\Dp^ + 0(eq/|G|V). 


Also, the one-step change in —Dp^ is at most (1 -|- 0(l/|G|))60p/|G|, and 
in —erf is at most —(1 -|- 0{l/\G\p))lS}p‘/^D/\G\p. We deduce 

E'[rg+(i + 1) - T^{i)] < -lQh^/^D/\G\p + 0{\og^p)bD/\G\p‘^ < 0. 


Thus the required estimates hold, i.e. for 0 < z < (1 — 6^/'^)|G|/3 whp 
Q{i) = |G|Op^/3 ± Bq and Te{i) = Dp^ ± erf for all e G G®. 

Now w e ap ply the same method to deduce the boundedness conclusion of 


Theorem 


2.1 


For 0 < z < (1 — 6^/‘^)|G|/3 we show whp 
|G*(z;)| = p|G(z;)| ± for any vertex v, 


where e.i; = 2h^/^d{G)n. The upper bound G+(z) := |G*(z;)|—p|G(z;)|—e.^ < 0 
will follow whp after we verify the following two conditions. 


Trend hypothesis: If G+(z) > —b^/^d{G)n then E'G+(z -1- 1) < G+(z). 


Boundedness hypothesis: —0 < G^(z -|- 1) — G^(z) < 9 with 6 < 0/10 
and 00|G|plogrz < (6^/^d(G)n)^. 


For the boundedness hypothesis, we can take 0 = 2 and 9 = 3|G(z;)|/|G|, 
so 00|G|plogn = 6p|G(z;)| logn = O(nlogn), whereas {b^^^d{G)n)'^ > 
b^n^ > . For the trend hypothesis, if G/'(z) > —b^^^d{G)n we have 

E'[|G®+i(t^)|-|G®(z;)|] = - rg(z)/g(z) 

eitiSeSG® 

< -(p|G(z;)| + e, - b^/^d{G)n){Dp^ - erf)/(|G|T»pV3 + e^) 

< -3|G(z;)|/|G| - 3(e, - b^/^d{G)n)/\G\p + O(erf)^ + 0(e,) 

The one-step change in —p|G(z;)| — e„ is at most 3|G(n)|/|G|, so 

pn E'[G+(i + 1) - G+(z)] < -66^/3 ^ o{edlDp) + 0{eq/\G\Dp^) < 0, 

as Cd/Dp = 0{b’^/^p~^ logp), eq/\G\Dp^ = 0{bp~^ log^p) and p > 6^/^. 

Thus, letting N be the set of triangles removed during the process, whp 
L = G \ UN is 6^/^-bounded. 


6. The number of designs 

In this section we generalise Theorem |2.2| to estimate the number of de¬ 
signs. We start by describing the results of [5] on the existence of designs. 
Let T> be a g-graph (i.e. a set of subsets of size q) of a set X of size n. We say 
that H is a design with parameters (n, q, r. A) if every subset of X of size r 
belongs to exactly A elements of S. Note that ifq' = 3,r = 2,A = l then D 
is a Steiner Triple System. The necessary divisibility conditions generalise 
in a straightforward way: if D exists then {fZD must divide A (!/Zi) for every 
0<z<r — l;to see this, hx any z-subset / of A and consider the sets in 
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D that contain I. In [5] we proved the ‘Existence Conjecture’, which states 
that these divisibility conditions are also sufficient for the existence of a de¬ 
sign with parameters (n, q, r, A), assuming q, r, A are fixed and n > no{q, r, A) 


is large. We will generalise Theorem 2.2 as follows. 

Theorem 6.1. For any q,r,X there is no such that if n > no and (^I-) | 
A(”~*) for all 0 < i < r — 1, writing Q = (^) and N = (^Zl) > number 
D{n, q, r, A) of designs with parameters (n, q, r, A) satisfies 

D{n, q, r, A) = A!“ O {{X/e)^-^N + G). 


The proof of Theorem |6.1| follows that of Theorem |2.2t the lower bound 
generalises the argument given earlier in this paper, and the upper bound 
generalises that of Linial and Luria [8] . 

We start with the lower bound. In the same way as a Steiner Triple System 
can be viewed as a triangle decomposition of Kn, we can view a design with 
parameters (n, q, r, A) as a K^-decomposition of XKlf, where denotes the 
complete r-graph on q vertices and XK^ denotes the multi(hyper)graph in 
Klf in which each edge has multiplicity A. To generalise Theorem |2.1[ we 
first need to define the divisibility and typicality conditions for general r- 
graphs (we will omit multiplicities in the definitions, as we do not need them 
for our application here). 

For S C V{G), the neighbourhood G{S) is the (r — |5|)-graph {/ C 
V{G) \ 5 : / U S' G G}. We say that G is K^-divisible if (^I*) divides |G'(e)| 
for any i-set e C V{G), for all 0 < f < r. We say that G is {c,h)-typical if 
there is some p > 0 such that for any set ^ of (r — l)-subsets of V{G) with 
\A\ < h we have |n 5 gAG'(S)| = (1 ± c)p^^^n. 

When d{G) 


Now we can state the r-graph generalisation of Theorem 


2.1 


is at least a constant independent of n this follows from [5l Theorem 1.4]; 
the same proof shows that d{G) can decay polynomially in n. 

Theorem 6.2. For any q > r > 1 there are co,a G (0,1) and h,£,no G N 
so that if n > no and G is a K^-divisible (c, h)-typical r-graph on n vertices 
with d{G) > n~°‘ and c < cod{GY then G has a K^-decomposition. 

In the proof of Theorem |6.1| it is more convenient to count designs together 
with a choice for each e G Kf of a bijection between the A copies of e in XKf 
and the A sets of the design containing e; we will refer to such a structure as 
an edge-labelled design with parameters (n, q, r. A) and denote their number 

by D* (n, q,r, X). As D*{n,q,r, X) = Xl\r) D{n, q, r. A), it suffices to show 
D*{n, q, r, A) = {{X/e)^-^N + o{N)Y‘^~' G). 


Proof of Theorem 6.1| . For the lower bound we start by setting aside 
a random subgraph R of K'f in which each edge is chosen with probability 
n-alQ 


where we can apply Theorem 


6.2 


with a and £, and we suppose 
£ <C 1/a (i.e. parameters are 


without loss of generality that Q h 
chosen from left to right to satisfy various inequalities below). 

Next we will consider the random greedy matching process in the following 
auxiliary hypergraph A. We let V{A) consist of A copies of each edge of 
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\ R and A — 1 copies of each edge of R. We let E{A) consist of all 
(^)-sets in V{A) that are edge-sets of a copy of in 

In the random greedy matching process, we start with A, and at each 
step we select a uniformly random edge e of the current hypergraph, then 
delete all vertices of e (and all incident edges) to obtain the hypergraph for 
the next step. We stop the process when fewer than vertices of A 

remain and let L denote the multigraph in consisting of the remaining 
vertices of A. Similarly to the previous section, one can adapt the analysis of 
Bennett and Bohman [T] to show that whp (i) when pX ((() edges remain the 
number of choices for the next edge of the process is (1 ± , 


and (ii) \L{e)\ < for any (r — l)-set e C [n]. 

Next we apply a random greedy algorithm to sequentially cover each edge 
of L by a copy of in which all other edges are in R] as usual, at each 
step we make a uniformly random choice subject to not using any previously 
covered edge. The proof of Lemma 2.10| generalises to show that whp the 
algorithm does not abort, and writing S for the subgraph of R covered by 
the algorithm, whp |S'(e)| < for any (r — l)-set e C [n]. By Chernoff 

bounds whp R is /i)-typical. Also whp \R\ > ^ so lA := R\S 

is (con“^“,/i)-typical with |i?'| > Furthermore, R! was obtained from 

AAT^ by deleting edge-disjoint copies of AT^, and AiF^ is iF^-divisible by 
assumption, so H is iL^-divisible. Therefore TA has a AT^-decomposition by 
Combining this with the previous choices of AT^’s we have 


Theorem 6.2 


constructed an edge-labelled design with parameters (n, (?, r. A) (the vertices 
of A specify the edge-labelling). 

Now we may calculate similarly to the proof of Theorem |2.2[ Writing 
m for the number of steps in the random greedy matching process, and 
p{i) = 1 — — iQ{^) ^ for the approximate density at the Ah step, the 

logarithm of the number of choices is 


Li = ^(log(p(i)«A'3(9)±2n-VQ) = AQ'^ ((!) (log(A«(p) - Q ± 
i=l 


Also, for any fixed design, the logarithm of the number of times it is counted 
is at most 

m 

A2 = Y,^og{p{i)XQ-\^)) = XQ-\^){\og{XQ-\^)) - l±n-“/2«). 

i=l 

As ^ = N + o{N) we deduce 

log D*{n, q, r, A) > XQ-^{^) logHX/ef-^N + o{N)). 

For the upper bound in Theorem |6.1| we apply the Entropy Method, 
following the argument of Linial and Luria [8] for Steiner Triple Systems 
(see their paper for motivation and exposition of the method). We let X 
be a uniformly random edge-labelled design with parameters (n, q, r, A), and 
consider the entropy H{X) = — Y1,d ~ loglP(^ = D) (using natural 
logarithms). We have D*{n, q, r, A) = so it suffices to estimate H{X). 

We consider the labelled edges of AA^ in a uniformly random order: it 
is convenient to select p = (pe) G [0,1]^^" uniformly at random, and to 
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proceed by decreasing order of /ie- At each step, when we consider e, we 
reveal the block Xg, of X that contains e and is assigned to e according to 
the edge-labelling. Conditional on /r, we have 

H{X) = Y, H{Xg I (Xe. : /r(e') > //(e))). 

e&\Kr 

We estimate H{Xg \ {Xg/ : /i(e') > lJ.{e))) < logNg, where Ng is the size 
of the support of the random variable Xg \ {Xgi : ii{e') > //(e)), i.e. Ng = 1 
if e is a labelled edge of Xgi for some e' that precedes e, otherwise Ng is the 
number of choices of a labelled g-set / containing e (i.e. we fix labellings of 
the other Q — 1 edges in /) such that for each such labelled edge e' in /, no 
labelled edge of the block Xgi precedes e. 

Next we condition on X, fix e, and write Fg for the event that ^{e') < //(e) 
for all F G Xg. We estimate Elog where the expectation is with respect 
to //, and we suppress the A-conditioning in our notation. We have 

Elog = E(E[logA,^ I //e]) = E(//?-iE[logA,^ | ^g,Fg\) 

and by Jensen’s inequality E[logA^ | //e, Ag] < logE[Ai‘ | fig,Fg\. 

Now we write K[Ne \ fig,Fg\ = 1 -|- I f^e,Fg], where the sum is 

over all labelled g-sets / 7 ^ Xg containing e, and Aj is the event that for 
each labelled edge e' in /, no labelled edge of the block Xgi precedes e. Note 
that there are only 0{N/n) such / with \ f Ci f\ > r for some block f' of 
X. For any other such / we have E[Aj | ^g,Fg] = We deduce 

E[Ae^ I ^^g,Fg\ = + 0{N/n). 

Finally, 

logD*{n,q,r,X) = H{X)< ^ E^l^-HogE[N^ \ fig,Fg] 

e£XK- 

= A(") [ \og{t^^^-^h^-^N) + 0{l/n) dt. 

’’ Jo 

For any A,B,C > 0 we have log(C't'®) dt = A~^\ogC — A~‘^B. 

Setting A = Q, B = Q{Q — 1), C = A*^“^A we deduce 

log A*(n,g,r. A) <AQ-i((!)log((A/e)«-iA + 0 (A)). □ 

7. Concluding remarks 

Although we have proved (and generalised) Wilson’s conjecture, one may 
still ask for more precise estimates (even an asymptotic formula) for the 
number of Steiner Triple Systems, and more generally designs. Such results 
have been obtained by Kuperberg, Lovett and Peled [7] , using very different 
methods to ours, but only for designs within a certain range of parameters. 
One open case of particular interest (recently drawn to my attention by Ron 
Peled) is the problem of estimating the number G{n,d) of d-regular graphs 
on n vertices. These may be viewed as designs with parameters (n, 2,1, d), 
for which our methods give G{n,d) = d\~'^{dn/e + o(dn))'^”'/^. Much more 
precise results have been obtained by McKay and Wormald, including as¬ 
ymptotic enumeration for d = a;(n/ logn) (see [9j) and d = o{^/n) (see [TO]!: 
their conjecture in |9] regarding a general asymptotic formula remains open. 
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